Picture for Jian Liang

Jian Liang

WorldCoder-Bench: Benchmarking Physically Grounded 3D World Synthesis

Add code
Jun 01, 2026
Viaarxiv icon

VeriTrip: A Verifiable Benchmark for Travel Planning Agents over Unstructured Web Corpora

Add code
May 27, 2026
Viaarxiv icon

Counteraction-Aware Multi-Teacher On-Policy Distillation for General Capability Recovery with Domain Preservation

Add code
May 26, 2026
Viaarxiv icon

Kwai Summary Attention Technical Report

Add code
Apr 27, 2026
Viaarxiv icon

Understanding and Mitigating Spurious Signal Amplification in Test-Time Reinforcement Learning for Math Reasoning

Add code
Apr 23, 2026
Viaarxiv icon

What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time

Add code
Mar 20, 2026
Viaarxiv icon

Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation

Add code
Feb 27, 2026
Viaarxiv icon

How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1

Add code
Feb 23, 2026
Viaarxiv icon

Mitigating the Safety-utility Trade-off in LLM Alignment via Adaptive Safe Context Learning

Add code
Feb 14, 2026
Viaarxiv icon

Stop Tracking Me! Proactive Defense Against Attribute Inference Attack in LLMs

Add code
Feb 12, 2026
Viaarxiv icon